Optical conductivity of wet DNA 
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Motivated by recent experiments we study the optical conductivity of DNA in its natural envi- 
ronment containing water molecules and counter ions. Our density functional theory calculations 
(using SIESTA) for four base pair B-DNA with order 250 surrounding water molecules suggest a 
thermally activated doping of the DNA by water states which generically leads to an electronic 
contribution to low-frequency absorption. The main contributions to the doping result from water 
near DNA ends, breaks, or nicks and are thus potentially associated with temporal or structural 
defects in the DNA. 

PACS numbers: 87.14.Gg, 87.15. Aa, 87.15.Mi 



The electronic properties of DNA have received consid- 
erable scientific attention in the last few years, motivated 
both by possible use in molecular electronics and by 
speculation that electronic processes can speed up the 
location (and possibly repair) of damage sites in living 
cells 13 • A number of experiments have studied conduc- 
tance of DNA in different contexts (for a recent review 
see Ref. 0). Of particular note have been observations 
of activated behavior with small gaps, of order 0.1-0.3 
eV 0, 13, which stand in contrast to the peak optical 
absorption of DNA in the ultraviolet at 3.5-4 eV. Evi- 
dently, such small activation energies, if not extrinsically 
induced, require an intrinsic doping mechanism. 

To address this issue, we study several DNA tetramers 
surrounded by waters and counterions using a combina- 
tion of classical molecular dynamics (MD) with density 
functional theory (DFT) employing the local basis set 
SIESTA code 0. To our knowledge, this is the first time 
that the electronic conductivity of DNA has been com- 
puted with a full quantum mechanical treatment of wa- 
ter and counterions in a dynamically fluctuating environ- 
ment, although similar methods have been used before 
to study charge migration in DNA 0, Q without explicit 
quantum treatment of the environment. Our tetramers 
are in the fully hydrated biological form (B-DNA), with 
order 250 water molecules. We find evidence that waters 
in contact with DNA bases can dope the DNA with ex- 
citation gaps as small as 0.1-0.3 cV. Wc find absorption 
at low frequencies in agreement with optical experiments 
if we increase the number of bases exposed to water. 
In a cellular environment, these will be associated with 
temporal or structural defects, but in many solid-state 
experiments can also arise from the fiattening and un- 
winding of the DNA helix. 

We also study the peak absorption in DNA in the ul- 
traviolet at 3.5-4 eV. The calculated frequency depen- 
dent conductivity agrees well with the measured peak 
absorption for A-phage DNA|^. While the peak loca- 
tion is found to be stable, there are distinct features near 
the peak, which vary with DNA sequence and different 



environmental configurations accessed during a typical 
MD run. Within the time scales of the MD (out to 1 
ns), our calculations are unable to uniquely determine 
sequence specificity, but some specificity /fingerprinting 
may be possible with signal averaging. 

These results confirm that while the DNA is a large- 
gap material, environmental conditions can induce states 
in the gap, which can in turn lead to a small but non-zero 
sub-gap absorption. Such states are also crucial for un- 
derstanding room temperature thermally activated elec- 
tron dynamics in the DNA, a mechanism that has gained 
wide acceptance for observed long-range electron transfer 
in DNA 
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Electronic structure calculations have previously been 
performed for dry A-DNA Q and for crystallized Z-DNA 
[lO| where the effects of solvent and counterions were 
also addressed. Here, we extend our recent work by 
considering dynamical fluctuations of the environment. 
Other efforts have been made to study electronic spectra 
of DNA in fluctuating environments both without 
and with 0, counterions and water. Our work differs 
from these latter efforts by the explicit quantum mechan- 
ical treatment of water and counterion states. 

To calculate the electronic spectra and optical conduc- 
tivity we have used the fully ab initio DFT code SIESTA 
0. It uses TrouUier-Martins norm-conservin g p seudo 
potentials ^3 in the Kleinman-Bylander form |l4j. We 
have used the generalized gradient approximation (GGA) 
for the exchange-correlation energy functional in the ver- 
sion of Perdew, Burke, and Ernzerhof . SIESTA uses 
a basis set of numerical atomic orbitals where the method 
by Sankey and Niklewski is employed. For the DNA 
we have used a double-C basis set except for phosphorus 
and the counterions for which the polarization orbitals 
are also included. For the surrounding water molecules 
(about 250) we have only used the minimal, single-C basis 
set. 
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The optical conductivity is given by the Kubo formula 
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and depends on the frequency uj. Here, £„, Sm and |n), 
m) are the energies and wave functions of the DFT 
Hamiltonian, respectively. V denotes the unit cell vol- 
ume, and with ^ =±, || is the component of the current 
operator perpendicular or parallel to the DNA strand di- 
rection. The conductivity computed here does not in- 
clude time dependent DFT or scissors corrections. 

The solvated DNA structures were obtained from clas- 
sical MD simulations with the AMBER7 package 
(parm98 force field). The four base-pair long DNA 
structures (B-DNA 5'-GAAT-3', 5'-GGGG-3', 5'-AAAA- 
3', and TT-dimer) were initially charge-neutralized with 
counterions (either by 6 Na"*" or 3 Mg^+) and solvated 
with about 600 TIP3P water molecules. A simulation 
box of dimensions 38x35.4x25A'^ (z-axis parallel to DNA 
axis) with periodic boundary conditions was applied. Af- 
ter equilibration at room temperature for 1 ns, the trajec- 
tory was recorded every 10 ps over a simulation time of 1 
ns. The resulting 100 snapshots were used for subsequent 
analysis. All simulations employed the SHAKE method 
to fix hydrogen-heavy atom distances allowing a 2 fs in- 
tegration time step. The cut-off for long-range interac- 
tions was set to 10 A. TT-dimer (segment 5'-ATTA-3' 
from PDB code 1SM5) was further constrained using the 
BELLY option to preserve the distorted DNA structure. 

To use the structures from MD simulations in DFT 
calculations we had to reduce the number of atoms. We 
only kept the first and second solvation shell (approx- 
imately 30 water molecules per nucleotide) so that all 
water molecules within a 4.7 A radius were included. We 
incorporated about 1000 atoms in our DFT calculation 
so that the computations were quite time consuming. In 
particular, the evaluation of the optical conductivity was 
very expensive. To speed up the computations we re- 
stricted the number of included unoccupied states which 
only affect the high-energy spectra above 5 eV. The calcu- 
lation of optical conductivity for a given structure snap- 
shot took about 5 days on AMD's Opteron CPU where 
approximately 2 GB RAM was needed. All calculations 
were done at room temperature. 

In order to identify the energetically important states 
near the Fermi energy, we projected the density of states 
(DOS) on the atomic Pz orbitals of the base pairs and on 
the water molecules. Wc also calculated the projected 
DOS of the counterions, phosphates, and the sugars. 
However, we only found contributions of the Pz orbitals 
and the water molecules near the Fermi level. A typical 
projected DOS of structure snapshot of a 5'-GAAT-3' se- 
quence with Mg counterions is shown in Fig. ^ As seen 
in panel (a), there is a rather large tt-tt* gap of about 
2.7 eV. However, if we project the DOS on a single base 
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FIG. 1: Typical PDOS of a structure snapshot of a 5'-GAAT- 
3' sequence with Mg counterions. Panel (a) shows the pz 
orbitals of atoms in base pairs. Panel (b) contains the PDOS 
of all water molecules (thin black line) where the contributions 
of water molecules near the sequence ends are plotted by a 
thick red line. The Fermi energy is set to 0.0 eV. 



we would find an intrabase tt-tt* gap of about 3.7 eV. It 
turns out that only such intrabase transitions have large 
dipolc matrix elements. Therefore, we observe optical 
gaps of the same size (compare Figs. |5| and Ol. 

Although a rather big tt-tt* gap is observed the ac- 
tual gap between the highest occupied molecular orbital 
(HOMO) and the lowest unoccupied molecular orbital 
(LUMO) of about 150 meV is very small [combine pan- 
els (a) and (b) of Fig.|2]. Thus, electrons could be excited 
from water states below the Fermi energy into unoccu- 
pied Pz orbitals due to thermal fluctuations. Notice that 
the water states just below the Fermi energy belong to 
water molecules near the ends of the sequence [see panel 
(b) of Fig. nj. Therefore, a small gap between HOMO 
and LUMO only appears if water molecules could enter 
the DNA structure on damage sites like ends, breaks, or 
nicks to contact the DNA bases. 

The results of the projected DOS suggest the possibil- 
ity of a thermally activated doping of the DNA by water 
states which should also lead to an electronic contribu- 
tion to the low-frequency absorption. We indeed observe 
low-energy features in the optical conductivity [panel (a) 
Fig. 12] . By studying the (fictitious) temperature depen- 
dence of these features within SIESTA, we confirm that 
they are due to transitions between thermally occupied 
TT* states of the DNA bases. Furthermore, as shown in 
panel (b), we also observe the pronounced tt-tt* transition 
at 3.8 cV. Whereas the polarization dependence of the 
low-energy absorption is rather small [see panel (a)] the 
in-plane tt-tt* transition is much more visible if the elec- 
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FIG. 2: Polarization dependence of the optical conduc- 
tivity for a structure snapshot of a 5'-GAAT-3' sequence 
with Na counterions where cri^x(i^) [''"1,11 (i^)] is plotted with 
solid [dashed] lines. Panel (a) shows the low-energy features 
whereas panel (b) presents the range of the tt-tv* transition. 
The theoretical line spectra are broadened with Gaussian 
functions of width 0.1 eV. 
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FIG. 3: Robustness of the tt-tv* transition regarding environ- 
ment and DNA sequence. Panel (a) shows the effects of the 
surrounding water molecules and counterions on the optical 
conductivity of a 5'-GAAT-3' sequence. The results for dif- 
ferent B-DNA sequences in a wet environment containing Mg 
counterions are compared in panel (b). The spectra for the 
wet 5'-GAAT-3' (5'-AAAA-3', 5'-GGGG-3', and TT-dimer) 
sequences have been averaged over the results of 9 (5) struc- 
ture snap shots. The theoretical line spectra are broadened 
with Gaussian functions of width 0.1 eV. 



trie field is perpendieular to the DNA strand direetion. 
A less pronounced anisotro py o f the optieal conduetivity 
has also been found in Ref. llOl. 

To take into account the dynamical character of the 
environment we average the results for the optical con- 
ductivity over up to 9 structure snapshots from the MD 
simulations. As discussed above, the low-energy features 
of the optical conductivity result from a doping of the 
DNA by water states. Therefore, they are strongly af- 
fected by environment fluctuations and we only obtain 
some smeared intensity in the low-energy range of the av- 
eraged conductivity. In contrast, the intrinsic tt-tt* gap 



is much less affected by dynamical fluctuations leading 
to well defined structures in that range. 

Due to the intrinsic character of the tt-tt* transition 
the averaged optical conductivity in the range 2 — 5 eV is 
barely affected by different counterions [see panel (a) of 
Fig. |S] . To show that the main structure in the spectra 
around 3.7 eV really has to be interpreted as the intra- 
base TT-TT* transition we also plot the optical conductivity 
of dry protonated B-DNA in panel (a) of Fig.|2| Although 
the spectra of wet and dry B-DNA considerably differ in 
the intensities, the peak positions agree quite well. Fur- 
thermore, as seen from panel (b) of Fig.O peak position 
and shape of the tt-tt* transition depend only weakly on 
sequence whereas remarkable differences are observed in 
the range 4 — 5 eV. 

In Fig.^lwe compare our results with recent optical ex- 
periments Q and find a nice agreement for the range of 
the TT-TT* transition above 10000 cm~^. However, we find 
pronounced polarization dependence of the conductivity 
(see Fig. whereas the experimental spectra are almost 
isotropic. The latter can be explained by substantial vari- 
ations in the axis of the macroscopically orientated DNA 
duplex 0. 

The spectra below 500 cm~^ arc also affected by vibra- 
tional modes of the double helix structure and the sur- 
rounding water molecules, which are not included in our 
DFT calculations. In particular, the optical conductivity 
below 100 cm^^ has been interpreted in Ref. ITsI as a pure 
water dipole relaxation. However, our DFT calculations 
clearly show an electronic conductivity even at very low 
frequencies (see Fig.QJ. As already discussed above, the 
low-frequency conductivity results from a doping of the 
DNA by water molecules near the bases. Therefore, the 
theoretically observed conductivity at low frequencies, 
which is approximately an order of magnitude smaller 
than the experimental values, would be further increased 
if more waters could enter the DNA structure on addi- 
tional damage sites like ends, breaks, or nicks. 

A striking aspect of the work of Briman et al. is 
the near identity of the microwave conductivity for sin- 
gle stranded and double stranded DNA This, as 
well as the order of magnitude discrepancy between our 
low frequency calculations and their data can possibly 
be resolved as follows: (i) We have order one water per 
tetramer associated with end bases; if this is upped to 
one level per base we gain a factor of eight in absorption 
intensity, close to that needed to resolve the discrepancy 
with the Briman et al. data. This is plausible for sin- 
gle stranded DNA adsorbed to the sapphire substrate of 
Ref. m (ii) To get the same low frequency conductivity 
contribution for double stranded DNA requires that the 
DNA flatten and unwind (the helix) as has been observed 
elsewhere for DNA on surfaces jl^]. In the unwound 
configuration the extra space between bases allows for 
entry by water molecules. Obviously some of the mi- 
crowave absorption will be caused by water motion alone 
as suggested by Ref. ^2 our purpose here is to note that 
electronic absorption affiliated with defect states can be 
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FIG. 4: Panel (a) is a log-log plot of the optical conductivity 
from Fig. 13 panel (a). Panel (b) shows experimental data for 
DNA in a 5% and 95% relative humidity environment taken 
from Ref. H 



comparable. 

A grand technological challenge of our times is to de- 
velop a rapid scheme for sequence differentiation in DNA. 
for example to be able to differentiate between alleles 
of a given gene. Current technologies, such as those of 
real-time or quantitative PGR, rely on fluorescent dyes 
and the FRET (Fluorescent Resonance Energy Transfer) 
method. This requires developing gene specific labeling 
dyes, or donors and acceptors, which will lead to fluo- 
rescent signals reflecting the amount of a given double- 
stranded DNA sequence in the system. However, the in- 
trinsic electronic [2l| and optical properties of DNA also 
vary with sequence. So, a natural question is: would it 
be possible to develop label-free optical fingerprinting of 
DNA that would allow one to infer different alleles di- 
rectly from optical experiments without the use of dyes? 

Optical fingerprinting would require two things to hap- 
pen. First, one would need optical signatures predomi- 
nantly from a given gene and one would need signals 



from different alleles to be sufficiently different. The for- 
mer may be achieved by gene-chip and by surface en- 
hancement techniques. We can address here the latter 
question, how different are the spectra from different se- 
quences and consequently how easy would it be to estab- 
lish such a difference experimentally. Clearly the spectra 
from different sequences strongly overlap and this makes 
it harder to distinguish them. Having large copies of the 
gene generated through polymerase chain reaction will 
lead to signal averaging and the time averaged signal can 
be distinguished if there is sufficient accuracy. For ex- 
ample, we have about 30% statistical variance for five 
configurations used in averaged conductivity calculations 
of tetramers while the means diverge at the w 5% level. 
Hence, we could expect single point frequency sampling 
to discern the sequences here with a configuration in- 
crease to 180. This number could likely be reduced with 
judicious use of multipoint frequency sampling. If a sim- 
ple definitive fingerprint is required without full sequence 
information, as might be appropriate in applications in 
forensics or pathogen detection, numbers in the ballpark 
of 100-1000 clones of a given sequence could be suffi- 
cient. This compares with the 10^ — 10^ levels of clones 
necessary for current state of the art DNA sequencing 
technology poj. Quantitative theoretical studies will be 
an important aid in developing such technologies. 
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